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ABSTRACT 


The  Data  Center  employs  the  IBM  1440  in  the  search  and  retrieval 
of  its  data  files.  The  files  are  maintained  on  EAM  cards  but  arc  to  be 
converted  to  disk  during  a  search.  The  Center  formerly  utilized  the 
IBM  101  Statistical  Machine  and  the  IBM  870  Document  Writing  System. 
The  Center  has  data  relating  to  more  than  1/2  million  material  tests  of 
metals  and  plastics.  Use  of  the  computer  has  permitted  the  Center  to 
cope  with  the  increasing  demand  on  its  services  without  increasing  its 
budget  by  the  same  order  of  magnitude. 
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I.  SUMMARY 


I.  SUMMARY 


The  Mechanical  Properties  Data  Center  employs  mechanized 
processes  in  the  storage  and  retrieval  of  its  data  files  which  consist  of 
the  mechanical  properties  of  metals  and  plastics.  Data  are  stored  on 
EAM  punched  cards  and  searched  using  the  Center's  IBM  1440  computer 
which  was  installed  in  late  1965.  Prior  to  that  time,  processing  was 
done  with  EAM  techniques  based  upon  the  IBM  870  Document  Writer- 
system  (foi  output  formatting)  and  the  IBM  101  Statistical  Machine 
(for  retrieval  by  selective  file  searching). 

The  Center  has  stored  data  relating  to  more  than  a  half  million 
material  tests  which  is  currently  growing  at  the  rate  of  about  8,  000 
records  per  month.  The  major  source  of  this  information  is  in  documents 
from  DDC.  In  addition,  original  test  records  are  received  from  some 
companies.  The  Center's  Director  considers  a  test  lab  report  to  be 
the  ideal  form  in  which  to  receive  information.  At  present  no  attempt 
is  made  to  include  classified  material. 

The  services  of  the  Mechanical  Properties  Data  Center  are 
available  free  of  charge  to  DoD,  NASA,  and  their  contractors.  More 
that  (50  queries  are  answered  in  a  typical  month.  The  largest  number 
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handled  in  a  single  month  was  ■  A  typical  search  might  produce 
300  specimen  tests  from  six  different  references  and  10  references  cited 
to  documents  that  discuss  the  subject,  but  contain  no  data.  Requests 
for  information  may  be  received  by  telephone,  TWX,  telegraph,  or  mail. 

Figure  1  illustrates  the  flow  of  information  to  the  Center. 
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MECHANIZATION 


II.  MECHANIZATION 


1.  CHRONOLOGY 

The  Center  was  started  in  June  195  9  with  the  award  of  a  contract 
by  the  Air  Force's  Directorate  of  Materials  and  Processes,  Aeronautical 
Systems  Division.  From  the  beginning,  EAM  punched  cards  were 
chosen  as  the  storage  medium.  Retrieval  of  information  and  processing 
was  done  using  electronic  accounting  machines,  particularly  the  IBM 
870  system  and  the  IBM  101  Electronic  Statistical  Machine. 

This  arrangement  was  preferred  to  a  computer  because  of  problems 
associated  with  obtaining  computer  time  and  because  of  the  difficulty  of 
refining  a  search  in  mid-process  through  the  data  file.  These  disadvan¬ 
tages  were  considered  greater  than  the  limitation  on  the  number  of  fields 
searchable  per  pass  imposed  by  the  IBM  101. 

In  19G5,  after  studying  available  systems,  the  Center  acquired 
an  IBM  1440  computer  and  developed  programs  to  perform  the  searching 
and  listing  formerly  done  with  the  EAM  equipment.  No  change,  however, 
was  made  in  the  data  storage  format.  The  Center  decided  on  the 
computer  because  they  felt  the  level  of  work  (both  for  the  Center  and 

-4- 


for  Belfour-Stulen's  other  activities)  justified  the  cost,  and  because 
the  1440  had  the  special  feature  of  random  access  disk  files  that  are 
easily  changed. 

Another  reason  for  acquiring  the  computer  was  to  lower  the  unit 
cost  of  searches,  particularly  to  keep  costs  within  the  Center's  budget 
Since  1964,  search  requests  have  increased  from  an  average  of  12  per 
month  to  60  per  month  (peak  was  90  per  month). 

2.  DESCRIPTION  OF  PROCESSES 
(1)  Input  Procedures 

The  primary  source  of  information  for  the  Center  is 
documented  numerical  data  from  tests  such  as  tensile  tests, 
compression  tests,  fracture  toughness  tests,  and  creep  tests. 
These  data  arc  entered  on  EAM  punched  cards  and  filed  by  test 
type  within  material  type.  The  process  is  illustrated  in  Figure 
and  has  the  following  procedure: 


1.  A  document  which  includes  test  results  is  selected, 
checked  to  see  if  it  is  already  in  the  files,  and  then 
ordered.  At  this  time  a  title  card  for  the  document  is 
prepared  manually. 

2.  When  the  document  is  received,  it  is  matched 
against  the  title  card,  and  checked  again  for  duplication. 
(This  step  is  currently  a  manual  process,  but  will  be 
eventually  converted  to  a  computer  process.  ) 


FIGURE  2 


3.  An  accession  number  is  assigned  to  the  document. 

This  is  a  live -digit  number  for  a  document  containing 
information  that  directly  relates  to  the  collection.  A 
four-digit  number  is  assigned  in  the  case  of  a  document 
of  a  more  general  nature. 

4.  At  this  point,  a  preliminary  indexing  takes  place  in 
order  to  provide  a  means  of  retrieving  the  document's 
information  before  the  document  has  been  selected  from  the 
backlog  and  fully  processed.  This  indexing  will  permit 
retrieval  by  means  of  one  or  more  of  the  subject  words 
most  frequently  mentioned  in  requests;  e.  g.  ,  alloy,  property 
or  test  conditions.  The  information  encoded  and  keypunched 
is  material  identification,  test  types,  test  conditions,  title 
and  author. 

5.  Documents  are  filed  in  numerical  accession  number 
order.  They  are  selected  for  encoding  into  the  data  storage 
on  a  basis  of  subjects  which  are  of  greatest  interest  at  the 
moment,  or  subjects  in  which  the  Center's  stored  data 
appears  to  be  low. 

6.  A  document  containing  information  selected  for 
routine  processing  goes  to  an  encoder  who  extracts  the 
information  to  be  used  and  records  it  on  encoding  forms. 
Numerical  data,  such  as  actual  mechanical  properties, 
test  temperature,  etc.  are  directly  keypunched  (except  for 
rounding  off  to  the  significant  places  consistent  with  desired 
accuracy)  without  encoding.  Other  information  is  assigned 
numeric  or  alphabetic  symbols  derived  from  a  prepared 
code  book.  The  code  book  is  not  a  rigid  thesaurus,  and 
new  codes  are  entered  whenever  necessary.  Appendix  A 
illustrates  several  of  the  work  sheets  which  are  used  to 
format  the  information  for  keypunching. 

There  are  two  basic  formats,  A  and  B.  The  A  format 
consists  of  the  final  test  results  together  w'ith  a  relatively 
complete  summary  of  the  material  condition,  manufacturing 
processes,  test  specimen  configuration,  specimen  condition¬ 
ing,  and  testing  environments.  All  of  this  A  format  informa¬ 
tion  is  keypunched  in  a  single  card  for  each  test.  In  most 
searches,  this  card  is  all  that  need  be  retrieved  to  obtain 
the  principal  information.  Two  examples  of  the  A  format 
worksheets  are  shown  in  Appendices  A-l  and  A -2.  One  is 
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labeled  Formal  MTC-A  for  metals,  tensile-creep  test, 

A  format.  The  other  is  labeled  MBT-A  for  metals, 
bending  fracture  toughness  test,  A  format. 

The  B  format  contains  more  information  such  as 
material  composition  and  heat  treatment.  These  cards 
are  filed  separately  from  the  A  ca’  ds  and  are  related  by 
a  common  serial  number,  which  is  a  composite  of  the 
document  accession  number,  the  unit,  set  and  specimen 
number.  The  number  is  discrete  within  test  type  (card 
columns  1-3),  and  by  adding  the  test  type  code  (columns  4, 

5),  it  becomes  unique. 

The  B  format  is  divided  into  two  types  of  cards, 

B  and  B^,  the  worksheets  for  which  are  shown  in 
Appendices  A-3  and  A-4. 

One  A  type  card  will  normally  exist  for  each  material 
specimen  and  one  B  type  card  for  each  unit  of  material. 
Furthermore,  the  B  type  express  only  the  basic  material 
and  are  independent  of  test  types.  Thus,  the  same  B  cards 
will  normally  exist  for  several  A  cards. 

In  addition  to  the  A  and  B  formats,  there  are  supple¬ 
mental  card  formats  referred  to  as  A  cards  and  an 
"encoder's  card.  "  There  is  also  a  format  variation 
specifically  for  fatigue  tests  (which  is  not  discussed).  The 
former  are  supplemental  to  the  basic  A  card;  e.  g.  ,  A^  is 
the  first  supplement  to  its  A  card,  A^  is  the  second,  etc. 

Note  that  each  of  these  A  's  relate  to  a  specific  A  card  and 
to  no  other.  The  "encoder's  card"  is  an  8-1/2"  x  11"  form 
which  is  generated  for  each  document  to  contain  significant 
information  not  covered  in  the  standard  format.  The  A  and 
encoders  formats  are  shown  in  Appendices  A -5  and  A-f/) 

7.  The  encoded  information  is  edited  to  catch  obvious 
errors  and  interpretative  discrepancies.  The  information 
is  then  keypunched  on  EAM  cards  and  verified. 

8,  The  cards  prepared  for  the  document  are  placed  in  a 
storage  drawer  at  the  keypunch  station.  When  8,  000  to 
10,000  cards  arc  accumulated,  a  machine  edit  is  made.  The 
cards  arc  sequenced  by  material  and  added  to  the  already 
existing  file  The  inventory  is  modified,  when  necessary. 

by  putting  in  delete  cards. 
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FIGURE  3 


Retrieval  Proces 


In  practice,  the  center  maintains  three  files.  Since 
questions  come  in  by  materials  type,  one  file  is  organized 
by  materials  type.  The  second  file  is  organized  by  test 
type  within  materials  type.  The  third  file  is  completely 
random.  Which  file  is  searched  depends  upon  the  charac¬ 
teristics  of  a  query. 

(2)  Query  Preparation  and  Retrieval  Outputs  (See  Figure  3) 

1.  A  request  for  information  is  refined  and  initially 
recorded  on  the  Search  Request  Form  illustrated  in  13-1. 

The  codes  for  the  various  parameters  are  determined  and 
the  request  is  redefined  on  the  coded  search  form  illustrated 
in  B-2.  The  system  will  accommodate  a  maximum  of  10  OR 
conditions  and  5  AND  conditions  within  each  of  these  OR 
conditions.  (A  NOT  parameter  is  indicated  by  a  flag  in  the 
NOT  column  of  a  particular  AND  condition. ) 

2.  Attribute  cards  are  then  keypunched  with  the  coded 
search  parameters. 

3.  Either  pertinent  data  cards  are  selected  from  the 
materials  or  test  type  files,  or  the  first  group  of  cards  to 

be  searched  is  obtained  from  the  random  file.  The  computer 
then  searches  the  data  cards  for  the  desired  parameters 
based  on  the  material  field  (card  column  5)  and  the  test  type 
field  (card  column  4)  using  the  attribute  cards  to  define  the 
desired  information.  The  printout  of  the  first  rough  sort  is 
referred  to  as  the  "blast -out".  An  example  may  indicate 
that  further  refining  or  narrowing  of  the  search  is  necessary. 
The  search  is  run  again  using  deletion  cards  to  remove  un- 
desired  information  and  format  heading  cards  to  obtain  the 
desired  output  form  in  the  final  printout.  An  example  of  a 
final  printout  is  shown  in  Appendix  C.  The  Center  is  equipped 
with  a  card-to-graph  x-y  plotter  which,  along  with  the  computer, 
provides  the  Center  a  capability  of  producing  data  tabulations, 
listings,  graphic  displays  and  other  pre-programmed  print¬ 
outs. 

4.  From  time  to  time,  personnel  at  the  Center  retrieve 
the  entire  inventory  of  a  particular  subject  test  type.  This 
is  published  and  distributed  to  members  of  the  aerospace 
and  defense  industries  as  a  means  of  acquainting  them  with 
the  resources  of  the  Mechanical  Properties  Data  Center. 
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3. 


ACTIVITIES  BEING. PLANNED  OH  DEVELOPED  FOR 
MECHANIZATION 

A  KW1C  index  of  document  titles  may  be  produced  for  in-house 
use.  The  KW1C  index  would  be  used  as  a  tool  for  locating  the  contents 
of  documents  before  indexing  them  in  depth  and  as  a  duplicate -checking 

device.  It  would  also  give,  as  a  by-product,  some  thesaurus  information.  | 

j 

! 

All  of  the  data  are  now  stored  on  punched  cards.  The  Center  will  : 

probably  not  convert  these  to  tape  files  because  of  the  relative  ease  of 
updating  the  cards  and  the  low  rate  of  use  of  many  of  them.  Converting 
to  tape,  however,  may  be  considered  if  the  search  rate  exceeds  about 
30  per  day.  Certain  groups  of  cards  may  also  be  converted  to  tape  to 
create  a  protablc  file.  Another  possibility  being  considered  is  to  put 
card  groups  on  disks  in  order  to  permit  long  Boolean-type  searches. 
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III.  PROGRAM  SYSTEM  DATA 


The  IBM  1440  Computer  is  used  to  perform  data  search  and 
retrieval  processes  on  the  Center's  data  card  file.  This  activity 
has  been  done,  until  r-^cntly,  semi-manually  using  EAM  equipment 
centered  around  the  IBM  101  Statistical  Machine  and  the  IBM  870 
Document  Writing  System. 

1.  MAJOR  FILES 

(1)  Data  Card  File 

There  are  three  basic  types  of  EAM  cards  (denoted  by 
A,  A^,  and  13)  used  to  record  the  data.  The  formats  are  fixed 
and  are  described  in  Appendix  A.  The  cards  are  filed  manually 
by  materials  class  (column  7),  and  within  alloys  by  test  type 
(column  4).  Related  cards  are  linked  by  a  reference  number 
(column  1-3)  which  is  discrete  within  a  test  type,  and  a  test 
type  (column  4)  which,  taken  with  the  reference  number,  form 
a  unique  code. 

(2)  Master  Code  Disk  Pack 

The  format  of  this  disk  file  is  shown  in  D-l.  The  file 
is  used  as  a  code  dictionary  that  relate  card  types  and  attributes 
to  numerical  codes. 
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2. 


ROUTINES 


System  flow  diagrams  are  shown  in  Appendices  D- 2  through  D-5. 
These  are  described  as  they  appear  in  the  retrieval  process  in  the 
following  paragraphs: 

( ] )  Attribute  Cards  to  Disk 

This  routine  is  illustrated  in  D-2.  An  attribute  card, 
containing  the  codes  to  be  later  used  in  searching  the  data  cards, 
is  read  and  the  first  attribute  is  placed  in  main  memory.  Next 
the  Master  Code  Disk  Pack  is  searched  for  the  code  strings  that 
relate  to  the  attribute.  These  are  then  added  to  the  main  memory. 
This  process  is  iterated  for  all  attributes. 

( 2)  Data  Card  to  Disk 

D-3  illustrates  the  routine.  To  begin,  the  Master  Code 
Disk  Pack  is  replaced  by  a  Scratch  Disk  Pack.  The  first  data 
card  is  then  read  and  tested  for  the  desired  attribute  codes.  As 
these  are  located,  the  attribute  code  is  flagged  with  word  marks 
in  the  main  memory  and  the  corresponding  data  are  written  on  the 
scratch  disk.  This  process  is  iterated  until  all  data  cards  are 
read. 
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(3)  End  of  Job  Routine 


This  routine  is  illustrated  in  Appendix  D-4.  The  last 
sector  of  the  scratch  disk  that  is  used  is  recorded  on  the 
trailer  sector  of  the  scratch  disk.  The  scratch  disk  now 
consists  of  all  of  the  desired  attribute  codes  with  word  marks 
flagging  those  which  were  encountered  on  the  data  cards.  The 
scratch  disk  is  then  sorted,  becoming  the  Organized  Work  Pack. 

(4)  Blast  Out  Routine 

This  routine  is  illustrated  in  Appendix  D-5.  The  Organized 
Work  Pack  is  then  run  with  heading  cards,  and  the  Master  Code 
Pack,  and  printed  out.  The  heading  cards  identify  the  output 
format  and  the  Master  Code  Pack  supplies  the  alphabetic  descrip¬ 
tions  corresponding  to  the  attribute  codes. 

( 5 )  Pina  1  Report  Routine 

This  routine  is  shown  in  Appendix  D-6.  Following  a  review 
by  the  project  engineer,  the  desired  data  deletions  and  format 
instructions  are  keypunched.  These  are  run  with  the  final  report 
printout  and  x-y  plotting  cards  (if  needed). 
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IV.  EQUIPMENT,  COSTS,  AND  EVALUATIONS 


1.  EQUIPMENT 

IBM  1440  Computer: 

Central  Processor  with  8I<  character  core  memory 

2-Model  131  Disk  Storage  Drives 

(Each  disk  file  .has  20,  000  addressable  locations, 

100  characters  per  address,  7  bits  per  character. ) 

Model  1442  Card  Read  Punch  (400  cpm) 

-  Model  1403  High  Speed  Printer  (340  lines  per  minute) 


EAM  Equipment: 

02G  Card  Punch 

047  Tape  to  card  converter 

083  Sorter  with  sort  suppression 

05G  Verifier 

05  3  Collator 


IBM  870  Document  Writing  System: 

Mos.cly  X-Y  Plotter  with  model  GOA  card  translator  connected 
to  IBM  514  Reproducing  Punch.  (Reads  G  card  columns  per 
data  print  at  50  cards  per  minute.) 
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The  following  ISAM  equipments  are  to  be  eliminated  with  the 


introduction  of  the  1440  computer; 

101  Electronic  Statistical  Machine 

602  Calculating  Punch 

834  Control  Unit 

8G6  Non  Trans.  Typewriter 

2.  COSTS 

Equipment  Monthly  Rental: 

1440  Computer  including  processor,  disk  control, 
console,  etc.  -  $1,800 

1442  Card  Read  Punch  -  $445 

1403  Printer  with  1446  Control  -  $1,040 

1311  Disk  Storage  Drives  (2  units)  -  $745 

083  Sorter  with  Sort-Suppression  -  $112 

0  2G  Printing  Key  Punch  -  $60 

101  Electronic  Statistical  Machine  -  $275 

602  Calculating  Punch  -  $245 

834  Control  Unit  -  $124 

8GG  Non  Trans.  Typewriter  - 


$35 


Development: 

The  Center  has;  been  developing  its  system  of  data  storage 
and  retrieval  since  its  establishment  in  June,  1959.  The  recent 
computer  program  development  was  performed  by  the  Center's 
full  time  program  systems  engineer  who  has  spent  approximately 
2,  000  man-hours  developing  it. 

3.  FACILITY'S  EVALUATION 

Experience  has  indicated  to  the  Center  that  the  cost  equivalence 
point  between  computer  and  earn  searching  is  about  three  to  four 
searches  per  day.  Beyond  this  rate,  the  EAM  system  becomes  increas¬ 
ingly  cumbersome  to  the  point  of  impracticability. 

The  cost  of  an  individual  search  is  not  directly  related  to  the 
number  of  cards  involved.  Output  form  varies,  and  Data  Center 
personnel  review  displays  and  references  for  each  search  in  varying 
depth,  depending  on  user  requirements. 

Both  tht>  previous  EAM  system  and,  to  a  greater  extent,  the 
present  computer  system  permit  the  manipulation  of  data  taken  from 
many  sources  into  various  formats  using  a  variety  of  test  parameters. 
This  flexibility  in  relating  data  gives  the  Center  a  capability  of  pseudo¬ 
test  ing. 


-17- 


Earlier  computers  were  not  suitable  for  the  Center  because  of 
the  long  linear  tape  files  and  corresponding  long  search  time  that  would 
have  been  requ.ved.  The  present  computer  has  a  random -access,  high 
capacity  disc  memory  which  avoids  this  problem  and  has  the  additional 
advantage  of  simple  disc  file  changing. 

The  various  outputs  of  the  Center  have  been  intentionally  restricted 
to  requested  information  and  a  few  published  reports  which  are  for  the 
purpose  of  publicizing  and  stimulating  the  use  of  the  Center.  This 
restriction  is  due  to  the  Cenler's  policy  "not  to  contribute  to  the  volume 
of  published  literature  competing  for  the  attention  of  the  technical 
community". 
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